Dataset statistics
| Number of variables | 30 |
|---|---|
| Number of observations | 2260701 |
| Missing cells | 4735231 |
| Missing cells (%) | 7.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 2.8 GiB |
| Average record size in memory | 1.3 KiB |
Variable types
| CAT | 16 |
|---|---|
| NUM | 11 |
| URL | 1 |
| BOOL | 1 |
| UNSUPPORTED | 1 |
Reproduction
| Analysis started | 2020-10-29 15:36:03.361973 |
|---|---|
| Analysis finished | 2020-10-29 15:53:58.508170 |
| Version | pandas-profiling v2.6.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
id has a high cardinality: 2260701 distinct values | High cardinality |
emp_title has a high cardinality: 512694 distinct values | High cardinality |
issue_d has a high cardinality: 139 distinct values | High cardinality |
desc has a high cardinality: 124501 distinct values | High cardinality |
title has a high cardinality: 63155 distinct values | High cardinality |
zip_code has a high cardinality: 956 distinct values | High cardinality |
addr_state has a high cardinality: 51 distinct values | High cardinality |
earliest_cr_line has a high cardinality: 754 distinct values | High cardinality |
funded_amnt is highly correlated with loan_amnt and 2 other fields | High Correlation |
loan_amnt is highly correlated with funded_amnt and 2 other fields | High Correlation |
funded_amnt_inv is highly correlated with loan_amnt and 2 other fields | High Correlation |
installment is highly correlated with loan_amnt and 2 other fields | High Correlation |
fico_range_high is highly correlated with fico_range_low | High Correlation |
fico_range_low is highly correlated with fico_range_high | High Correlation |
sub_grade is highly correlated with grade | High Correlation |
grade is highly correlated with sub_grade | High Correlation |
member_id has 2260701 (100.0%) missing values | Missing |
emp_title has 167002 (7.4%) missing values | Missing |
emp_length has 146940 (6.5%) missing values | Missing |
desc has 2134634 (94.4%) missing values | Missing |
title has 23358 (1.0%) missing values | Missing |
annual_inc is highly skewed (γ1 = 493.8860884) | Skewed |
dti is highly skewed (γ1 = 29.20185447) | Skewed |
member_id is an unsupported type, check if it needs cleaning or further analysis | Rejected |
delinq_2yrs has 1839108 (81.4%) zeros | Zeros |
inq_last_6mths has 1381722 (61.1%) zeros | Zeros |
| Distinct count | 2260701 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 17.2 MiB |
| 1706248 | 1 |
|---|---|
| 136433415 | 1 |
| 18204415 | 1 |
| 69752402 | 1 |
| 139649583 | 1 |
| Other values (2260696) |
| Value | Count | Frequency (%) | |
| 1706248 | 1 | < 0.1% | |
| 136433415 | 1 | < 0.1% | |
| 18204415 | 1 | < 0.1% | |
| 69752402 | 1 | < 0.1% | |
| 139649583 | 1 | < 0.1% | |
| 141412540 | 1 | < 0.1% | |
| 84404486 | 1 | < 0.1% | |
| 84594351 | 1 | < 0.1% | |
| 95209919 | 1 | < 0.1% | |
| 74626314 | 1 | < 0.1% | |
| Other values (2260691) | 2260691 | > 99.9% |
Length
| Max length | 48 |
|---|---|
| Mean length | 8.265717138 |
| Min length | 5 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 17 | 54.8% | |
| Decimal_Number | 10 | 32.3% | |
| Uppercase_Letter | 2 | 6.5% | |
| Space_Separator | 1 | 3.2% | |
| Other_Punctuation | 1 | 3.2% |
| Value | Count | Frequency (%) | |
| Latin | 19 | 61.3% | |
| Common | 12 | 38.7% |
| Value | Count | Frequency (%) | |
| ASCII | 31 | 100.0% |
| Distinct count | 1572 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 33 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15046.93123 |
|---|---|
| Minimum | 500 |
| Maximum | 40000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 17.2 MiB |
Quantile statistics
| Minimum | 500 |
|---|---|
| 5-th percentile | 3250 |
| Q1 | 8000 |
| median | 12900 |
| Q3 | 20000 |
| 95-th percentile | 35000 |
| Maximum | 40000 |
| Range | 39500 |
| Interquartile range (IQR) | 12000 |
Descriptive statistics
| Standard deviation | 9190.245488 |
|---|---|
| Coefficient of variation (CV) | 0.6107720803 |
| Kurtosis | -0.1194391577 |
| Mean | 15046.93123 |
| Median Absolute Deviation (MAD) | 7445.307472 |
| Skewness | 0.7777823287 |
| Sum | 3.401611592e+10 |
| Variance | 84460612.13 |
| Value | Count | Frequency (%) | |
| 10000 | 187236 | 8.3% | |
| 20000 | 131006 | 5.8% | |
| 15000 | 123226 | 5.5% | |
| 12000 | 121681 | 5.4% | |
| 35000 | 86285 | 3.8% | |
| 5000 | 84765 | 3.7% | |
| 8000 | 75033 | 3.3% | |
| 6000 | 72089 | 3.2% | |
| 25000 | 66453 | 2.9% | |
| 16000 | 66418 | 2.9% | |
| Other values (1562) | 1246476 | 55.1% |
| Value | Count | Frequency (%) | |
| 500 | 11 | < 0.1% | |
| 550 | 1 | < 0.1% | |
| 600 | 6 | < 0.1% | |
| 700 | 3 | < 0.1% | |
| 725 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 40000 | 33368 | 1.5% | |
| 39975 | 11 | < 0.1% | |
| 39950 | 10 | < 0.1% | |
| 39925 | 14 | < 0.1% | |
| 39900 | 24 | < 0.1% |
| Distinct count | 1572 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 33 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15041.66406 |
|---|---|
| Minimum | 500 |
| Maximum | 40000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 17.2 MiB |
Quantile statistics
| Minimum | 500 |
|---|---|
| 5-th percentile | 3250 |
| Q1 | 8000 |
| median | 12875 |
| Q3 | 20000 |
| 95-th percentile | 35000 |
| Maximum | 40000 |
| Range | 39500 |
| Interquartile range (IQR) | 12000 |
Descriptive statistics
| Standard deviation | 9188.413022 |
|---|---|
| Coefficient of variation (CV) | 0.6108641296 |
| Kurtosis | -0.1170090387 |
| Mean | 15041.66406 |
| Median Absolute Deviation (MAD) | 7442.540455 |
| Skewness | 0.7787785936 |
| Sum | 3.40042086e+10 |
| Variance | 84426933.87 |
| Value | Count | Frequency (%) | |
| 10000 | 187146 | 8.3% | |
| 20000 | 130816 | 5.8% | |
| 15000 | 123110 | 5.4% | |
| 12000 | 121588 | 5.4% | |
| 35000 | 86147 | 3.8% | |
| 5000 | 84751 | 3.7% | |
| 8000 | 75020 | 3.3% | |
| 6000 | 72075 | 3.2% | |
| 16000 | 66331 | 2.9% | |
| 25000 | 66176 | 2.9% | |
| Other values (1562) | 1247508 | 55.2% |
| Value | Count | Frequency (%) | |
| 500 | 11 | < 0.1% | |
| 550 | 1 | < 0.1% | |
| 600 | 6 | < 0.1% | |
| 700 | 3 | < 0.1% | |
| 725 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 40000 | 33368 | 1.5% | |
| 39975 | 11 | < 0.1% | |
| 39950 | 10 | < 0.1% | |
| 39925 | 14 | < 0.1% | |
| 39900 | 24 | < 0.1% |
| Distinct count | 10057 |
|---|---|
| Unique (%) | 0.4% |
| Missing | 33 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15023.43775 |
|---|---|
| Minimum | 0 |
| Maximum | 40000 |
| Zeros | 233 |
| Zeros (%) | < 0.1% |
| Memory size | 17.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3200 |
| Q1 | 8000 |
| median | 12800 |
| Q3 | 20000 |
| 95-th percentile | 35000 |
| Maximum | 40000 |
| Range | 40000 |
| Interquartile range (IQR) | 12000 |
Descriptive statistics
| Standard deviation | 9192.331679 |
|---|---|
| Coefficient of variation (CV) | 0.6118660612 |
| Kurtosis | -0.1166814573 |
| Mean | 15023.43775 |
| Median Absolute Deviation (MAD) | 7444.677901 |
| Skewness | 0.7782542751 |
| Sum | 3.396300496e+10 |
| Variance | 84498961.69 |
| Value | Count | Frequency (%) | |
| 10000 | 177561 | 7.9% | |
| 20000 | 120453 | 5.3% | |
| 15000 | 114539 | 5.1% | |
| 12000 | 114068 | 5.0% | |
| 5000 | 81999 | 3.6% | |
| 35000 | 76093 | 3.4% | |
| 8000 | 71528 | 3.2% | |
| 6000 | 69475 | 3.1% | |
| 16000 | 61840 | 2.7% | |
| 25000 | 60610 | 2.7% | |
| Other values (10047) | 1312502 | 58.1% |
| Value | Count | Frequency (%) | |
| 0 | 233 | < 0.1% | |
| 0.000121098108 | 1 | < 0.1% | |
| 0.000185369401 | 1 | < 0.1% | |
| 0.000242055511 | 1 | < 0.1% | |
| 0.000531133069 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 40000 | 31767 | 1.4% | |
| 39975 | 616 | < 0.1% | |
| 39950 | 218 | < 0.1% | |
| 39925 | 58 | < 0.1% | |
| 39900 | 29 | < 0.1% |
term
Categorical
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 33 |
| Missing (%) | < 0.1% |
| Memory size | 17.2 MiB |
| 36 months | |
|---|---|
| 60 months |
| Value | Count | Frequency (%) | |
| 36 months | 1609754 | 71.2% | |
| 60 months | 650914 | 28.8% | |
| (Missing) | 33 | < 0.1% |
Length
| Max length | 10 |
|---|---|
| Mean length | 9.999897819 |
| Min length | 3 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 7 | 63.6% | |
| Decimal_Number | 3 | 27.3% | |
| Space_Separator | 1 | 9.1% |
| Value | Count | Frequency (%) | |
| Latin | 7 | 63.6% | |
| Common | 4 | 36.4% |
| Value | Count | Frequency (%) | |
| ASCII | 11 | 100.0% |
int_rate
Real number (ℝ≥0)
| Distinct count | 673 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 33 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.09282911 |
|---|---|
| Minimum | 5.31 |
| Maximum | 30.99 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 17.2 MiB |
Quantile statistics
| Minimum | 5.31 |
|---|---|
| 5-th percentile | 6.49 |
| Q1 | 9.49 |
| median | 12.62 |
| Q3 | 15.99 |
| 95-th percentile | 22.15 |
| Maximum | 30.99 |
| Range | 25.68 |
| Interquartile range (IQR) | 6.5 |
Descriptive statistics
| Standard deviation | 4.832138365 |
|---|---|
| Coefficient of variation (CV) | 0.36906755 |
| Kurtosis | 0.5940245697 |
| Mean | 13.09282911 |
| Median Absolute Deviation (MAD) | 3.799125792 |
| Skewness | 0.7680705625 |
| Sum | 29598539.81 |
| Variance | 23.34956117 |
| Value | Count | Frequency (%) | |
| 11.99 | 53869 | 2.4% | |
| 5.32 | 47171 | 2.1% | |
| 10.99 | 44165 | 2.0% | |
| 13.99 | 43025 | 1.9% | |
| 11.49 | 32010 | 1.4% | |
| 16.99 | 30564 | 1.4% | |
| 12.99 | 29276 | 1.3% | |
| 7.89 | 28514 | 1.3% | |
| 9.17 | 27835 | 1.2% | |
| 15.61 | 25208 | 1.1% | |
| Other values (663) | 1899031 | 84.0% |
| Value | Count | Frequency (%) | |
| 5.31 | 8613 | 0.4% | |
| 5.32 | 47171 | 2.1% | |
| 5.42 | 573 | < 0.1% | |
| 5.79 | 410 | < 0.1% | |
| 5.93 | 1812 | 0.1% |
| Value | Count | Frequency (%) | |
| 30.99 | 819 | < 0.1% | |
| 30.94 | 733 | < 0.1% | |
| 30.89 | 699 | < 0.1% | |
| 30.84 | 755 | < 0.1% | |
| 30.79 | 1572 | 0.1% |
| Distinct count | 93301 |
|---|---|
| Unique (%) | 4.1% |
| Missing | 33 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 445.8068229 |
|---|---|
| Minimum | 4.93 |
| Maximum | 1719.83 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 17.2 MiB |
Quantile statistics
| Minimum | 4.93 |
|---|---|
| 5-th percentile | 110.43 |
| Q1 | 251.65 |
| median | 377.99 |
| Q3 | 593.32 |
| 95-th percentile | 984.47 |
| Maximum | 1719.83 |
| Range | 1714.9 |
| Interquartile range (IQR) | 341.67 |
Descriptive statistics
| Standard deviation | 267.1735346 |
|---|---|
| Coefficient of variation (CV) | 0.5993033774 |
| Kurtosis | 0.6898790426 |
| Mean | 445.8068229 |
| Median Absolute Deviation (MAD) | 211.60717 |
| Skewness | 1.001780569 |
| Sum | 1007821219 |
| Variance | 71381.6976 |
| Value | Count | Frequency (%) | |
| 301.15 | 4420 | 0.2% | |
| 332.1 | 4153 | 0.2% | |
| 361.38 | 3704 | 0.2% | |
| 327.34 | 3353 | 0.1% | |
| 602.3 | 3095 | 0.1% | |
| 451.73 | 3076 | 0.1% | |
| 329.72 | 2614 | 0.1% | |
| 166.05 | 2508 | 0.1% | |
| 498.15 | 2410 | 0.1% | |
| 180.69 | 2364 | 0.1% | |
| Other values (93291) | 2228971 | 98.6% |
| Value | Count | Frequency (%) | |
| 4.93 | 1 | < 0.1% | |
| 7.61 | 1 | < 0.1% | |
| 14.01 | 1 | < 0.1% | |
| 14.77 | 1 | < 0.1% | |
| 15.67 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1719.83 | 2 | < 0.1% | |
| 1717.63 | 1 | < 0.1% | |
| 1715.42 | 2 | < 0.1% | |
| 1714.54 | 6 | < 0.1% | |
| 1691.28 | 2 | < 0.1% |
| Distinct count | 7 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 33 |
| Missing (%) | < 0.1% |
| Memory size | 17.2 MiB |
| B | |
|---|---|
| C | |
| A | |
| D | |
| E | |
| Other values (2) | 53968 |
| Value | Count | Frequency (%) | |
| B | 663557 | 29.4% | |
| C | 650053 | 28.8% | |
| A | 433027 | 19.2% | |
| D | 324424 | 14.4% | |
| E | 135639 | 6.0% | |
| F | 41800 | 1.8% | |
| G | 12168 | 0.5% | |
| (Missing) | 33 | < 0.1% |
Length
| Max length | 3 |
|---|---|
| Mean length | 1.000029194 |
| Min length | 1 |
| Value | Count | Frequency (%) | |
| Uppercase_Letter | 7 | 77.8% | |
| Lowercase_Letter | 2 | 22.2% |
| Value | Count | Frequency (%) | |
| Latin | 9 | 100.0% |
| Value | Count | Frequency (%) | |
| ASCII | 9 | 100.0% |
| Distinct count | 35 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 33 |
| Missing (%) | < 0.1% |
| Memory size | 17.2 MiB |
| C1 | 145903 |
|---|---|
| B5 | 140288 |
| B4 | 139793 |
| B3 | 131514 |
| C2 | 131116 |
| Other values (30) |
| Value | Count | Frequency (%) | |
| C1 | 145903 | 6.5% | |
| B5 | 140288 | 6.2% | |
| B4 | 139793 | 6.2% | |
| B3 | 131514 | 5.8% | |
| C2 | 131116 | 5.8% | |
| C3 | 129193 | 5.7% | |
| C4 | 127115 | 5.6% | |
| B2 | 126621 | 5.6% | |
| B1 | 125341 | 5.5% | |
| C5 | 116726 | 5.2% | |
| Other values (25) | 947058 | 41.9% |
Length
| Max length | 3 |
|---|---|
| Mean length | 2.000014597 |
| Min length | 2 |
| Value | Count | Frequency (%) | |
| Uppercase_Letter | 7 | 50.0% | |
| Decimal_Number | 5 | 35.7% | |
| Lowercase_Letter | 2 | 14.3% |
| Value | Count | Frequency (%) | |
| Latin | 9 | 64.3% | |
| Common | 5 | 35.7% |
| Value | Count | Frequency (%) | |
| ASCII | 14 | 100.0% |
| Distinct count | 512694 |
|---|---|
| Unique (%) | 24.5% |
| Missing | 167002 |
| Missing (%) | 7.4% |
| Memory size | 17.2 MiB |
| Teacher | 38824 |
|---|---|
| Manager | 34298 |
| Owner | 21977 |
| Registered Nurse | 15867 |
| Driver | 14753 |
| Other values (512689) |
| Value | Count | Frequency (%) | |
| Teacher | 38824 | 1.7% | |
| Manager | 34298 | 1.5% | |
| Owner | 21977 | 1.0% | |
| Registered Nurse | 15867 | 0.7% | |
| Driver | 14753 | 0.7% | |
| RN | 14737 | 0.7% | |
| Supervisor | 14297 | 0.6% | |
| Sales | 13050 | 0.6% | |
| Project Manager | 10971 | 0.5% | |
| Office Manager | 9772 | 0.4% | |
| Other values (512684) | 1905153 | 84.3% | |
| (Missing) | 167002 | 7.4% |
Length
| Max length | 78 |
|---|---|
| Mean length | 14.66562938 |
| Min length | 1 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 53 | 31.2% | |
| Uppercase_Letter | 35 | 20.6% | |
| Other_Punctuation | 18 | 10.6% | |
| Control | 13 | 7.6% | |
| Decimal_Number | 10 | 5.9% | |
| Math_Symbol | 7 | 4.1% | |
| Other_Symbol | 5 | 2.9% | |
| Format | 4 | 2.4% | |
| Modifier_Symbol | 3 | 1.8% | |
| Dash_Punctuation | 3 | 1.8% | |
| Other values (9) | 19 | 11.2% |
| Value | Count | Frequency (%) | |
| Common | 81 | 47.6% | |
| Latin | 76 | 44.7% | |
| Cyrillic | 6 | 3.5% | |
| Armenian | 3 | 1.8% | |
| Inherited | 2 | 1.2% | |
| Greek | 2 | 1.2% |
| Value | Count | Frequency (%) | |
| ASCII | 94 | 79.7% | |
| Punctuation | 9 | 7.6% | |
| Cyrillic | 6 | 5.1% | |
| Armenian | 3 | 2.5% | |
| IPA Ext | 2 | 1.7% | |
| Letterlike Symbols | 1 | 0.8% | |
| Math Operators | 1 | 0.8% | |
| Misc Symbols | 1 | 0.8% | |
| VS | 1 | 0.8% |
| Distinct count | 11 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 146940 |
| Missing (%) | 6.5% |
| Memory size | 17.2 MiB |
| 10+ years | |
|---|---|
| 2 years | |
| < 1 year | |
| 3 years | |
| 1 year | 148403 |
| Other values (6) |
| Value | Count | Frequency (%) | |
| 10+ years | 748005 | 33.1% | |
| 2 years | 203677 | 9.0% | |
| < 1 year | 189988 | 8.4% | |
| 3 years | 180753 | 8.0% | |
| 1 year | 148403 | 6.6% | |
| 5 years | 139698 | 6.2% | |
| 4 years | 136605 | 6.0% | |
| 6 years | 102628 | 4.5% | |
| 7 years | 92695 | 4.1% | |
| 8 years | 91914 | 4.1% | |
| (Missing) | 146940 | 6.5% |
Length
| Max length | 9 |
|---|---|
| Mean length | 7.420150652 |
| Min length | 3 |
| Value | Count | Frequency (%) | |
| Decimal_Number | 10 | 52.6% | |
| Lowercase_Letter | 6 | 31.6% | |
| Math_Symbol | 2 | 10.5% | |
| Space_Separator | 1 | 5.3% |
| Value | Count | Frequency (%) | |
| Common | 13 | 68.4% | |
| Latin | 6 | 31.6% |
| Value | Count | Frequency (%) | |
| ASCII | 19 | 100.0% |
home_ownership
Categorical
| Distinct count | 6 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 33 |
| Missing (%) | < 0.1% |
| Memory size | 17.2 MiB |
| MORTGAGE | |
|---|---|
| RENT | |
| OWN | |
| ANY | 996 |
| OTHER | 182 |
| Value | Count | Frequency (%) | |
| MORTGAGE | 1111450 | 49.2% | |
| RENT | 894929 | 39.6% | |
| OWN | 253057 | 11.2% | |
| ANY | 996 | < 0.1% | |
| OTHER | 182 | < 0.1% | |
| NONE | 54 | < 0.1% | |
| (Missing) | 33 | < 0.1% |
Length
| Max length | 8 |
|---|---|
| Mean length | 5.854246094 |
| Min length | 3 |
| Value | Count | Frequency (%) | |
| Uppercase_Letter | 11 | 84.6% | |
| Lowercase_Letter | 2 | 15.4% |
| Value | Count | Frequency (%) | |
| Latin | 13 | 100.0% |
| Value | Count | Frequency (%) | |
| ASCII | 13 | 100.0% |
| Distinct count | 89368 |
|---|---|
| Unique (%) | 4.0% |
| Missing | 37 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 77992.42869 |
|---|---|
| Minimum | 0 |
| Maximum | 110000000 |
| Zeros | 1667 |
| Zeros (%) | 0.1% |
| Memory size | 17.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 27600 |
| Q1 | 46000 |
| median | 65000 |
| Q3 | 93000 |
| 95-th percentile | 160000 |
| Maximum | 110000000 |
| Range | 110000000 |
| Interquartile range (IQR) | 47000 |
Descriptive statistics
| Standard deviation | 112696.1996 |
|---|---|
| Coefficient of variation (CV) | 1.444963331 |
| Kurtosis | 439001.6589 |
| Mean | 77992.42869 |
| Median Absolute Deviation (MAD) | 34549.45122 |
| Skewness | 493.8860884 |
| Sum | 1.763146758e+11 |
| Variance | 1.27004334e+10 |
| Value | Count | Frequency (%) | |
| 60000 | 87189 | 3.9% | |
| 50000 | 76355 | 3.4% | |
| 65000 | 64903 | 2.9% | |
| 70000 | 62078 | 2.7% | |
| 80000 | 59833 | 2.6% | |
| 40000 | 59684 | 2.6% | |
| 75000 | 58459 | 2.6% | |
| 45000 | 54534 | 2.4% | |
| 55000 | 51583 | 2.3% | |
| 100000 | 46977 | 2.1% | |
| Other values (89358) | 1639069 | 72.5% |
| Value | Count | Frequency (%) | |
| 0 | 1667 | 0.1% | |
| 0.36 | 1 | < 0.1% | |
| 1 | 42 | < 0.1% | |
| 2 | 1 | < 0.1% | |
| 3 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 110000000 | 1 | < 0.1% | |
| 61000000 | 1 | < 0.1% | |
| 10999200 | 1 | < 0.1% | |
| 9930475 | 1 | < 0.1% | |
| 9757200 | 1 | < 0.1% |
verification_status
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 33 |
| Missing (%) | < 0.1% |
| Memory size | 17.2 MiB |
| Source Verified | |
|---|---|
| Not Verified | |
| Verified |
| Value | Count | Frequency (%) | |
| Source Verified | 886231 | 39.2% | |
| Not Verified | 744806 | 32.9% | |
| Verified | 629631 | 27.9% | |
| (Missing) | 33 | < 0.1% |
Length
| Max length | 15 |
|---|---|
| Mean length | 12.06187107 |
| Min length | 3 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 11 | 73.3% | |
| Uppercase_Letter | 3 | 20.0% | |
| Space_Separator | 1 | 6.7% |
| Value | Count | Frequency (%) | |
| Latin | 14 | 93.3% | |
| Common | 1 | 6.7% |
| Value | Count | Frequency (%) | |
| ASCII | 15 | 100.0% |
| Distinct count | 139 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 33 |
| Missing (%) | < 0.1% |
| Memory size | 17.2 MiB |
| Mar-2016 | 61992 |
|---|---|
| Oct-2015 | 48631 |
| May-2018 | 46311 |
| Oct-2018 | 46305 |
| Aug-2018 | 46079 |
| Other values (134) |
| Value | Count | Frequency (%) | |
| Mar-2016 | 61992 | 2.7% | |
| Oct-2015 | 48631 | 2.2% | |
| May-2018 | 46311 | 2.0% | |
| Oct-2018 | 46305 | 2.0% | |
| Aug-2018 | 46079 | 2.0% | |
| Jul-2015 | 45962 | 2.0% | |
| Dec-2015 | 44343 | 2.0% | |
| Aug-2017 | 43573 | 1.9% | |
| Jul-2018 | 43089 | 1.9% | |
| Apr-2018 | 42928 | 1.9% | |
| Other values (129) | 1791455 | 79.2% |
Length
| Max length | 8 |
|---|---|
| Mean length | 7.999927014 |
| Min length | 3 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 14 | 42.4% | |
| Decimal_Number | 10 | 30.3% | |
| Uppercase_Letter | 8 | 24.2% | |
| Dash_Punctuation | 1 | 3.0% |
| Value | Count | Frequency (%) | |
| Latin | 22 | 66.7% | |
| Common | 11 | 33.3% |
| Value | Count | Frequency (%) | |
| ASCII | 33 | 100.0% |
loan_status
Categorical
| Distinct count | 9 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 33 |
| Missing (%) | < 0.1% |
| Memory size | 17.2 MiB |
| Fully Paid | |
|---|---|
| Current | |
| Charged Off | |
| Late (31-120 days) | 21467 |
| In Grace Period | 8436 |
| Other values (4) | 7138 |
| Value | Count | Frequency (%) | |
| Fully Paid | 1076751 | 47.6% | |
| Current | 878317 | 38.9% | |
| Charged Off | 268559 | 11.9% | |
| Late (31-120 days) | 21467 | 0.9% | |
| In Grace Period | 8436 | 0.4% | |
| Late (16-30 days) | 4349 | 0.2% | |
| Does not meet the credit policy. Status:Fully Paid | 1988 | 0.1% | |
| Does not meet the credit policy. Status:Charged Off | 761 | < 0.1% | |
| Default | 40 | < 0.1% | |
| (Missing) | 33 | < 0.1% |
Length
| Max length | 51 |
|---|---|
| Mean length | 9.110159636 |
| Min length | 3 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 18 | 47.4% | |
| Uppercase_Letter | 9 | 23.7% | |
| Decimal_Number | 5 | 13.2% | |
| Other_Punctuation | 2 | 5.3% | |
| Dash_Punctuation | 1 | 2.6% | |
| Open_Punctuation | 1 | 2.6% | |
| Space_Separator | 1 | 2.6% | |
| Close_Punctuation | 1 | 2.6% |
| Value | Count | Frequency (%) | |
| Latin | 27 | 71.1% | |
| Common | 11 | 28.9% |
| Value | Count | Frequency (%) | |
| ASCII | 38 | 100.0% |
pymnt_plan
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 33 |
| Missing (%) | < 0.1% |
| Memory size | 17.2 MiB |
| n | |
|---|---|
| y | 620 |
| (Missing) | 33 |
| Value | Count | Frequency (%) | |
| n | 2260048 | > 99.9% | |
| y | 620 | < 0.1% | |
| (Missing) | 33 | < 0.1% |
url
URL
| Distinct count | 2260668 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 33 |
| Missing (%) | < 0.1% |
| Memory size | 17.2 MiB |
| https://lendingclub.com/browse/loanDetail.action?loan_id=38635095 | 1 |
|---|---|
| https://lendingclub.com/browse/loanDetail.action?loan_id=91041952 | 1 |
| https://lendingclub.com/browse/loanDetail.action?loan_id=136220591 | 1 |
| https://lendingclub.com/browse/loanDetail.action?loan_id=51877925 | 1 |
| https://lendingclub.com/browse/loanDetail.action?loan_id=126099280 | 1 |
| Other values (2260663) | |
| (Missing) | 33 |
| Value | Count | Frequency (%) | |
| https://lendingclub.com/browse/loanDetail.action?loan_id=38635095 | 1 | < 0.1% | |
| https://lendingclub.com/browse/loanDetail.action?loan_id=91041952 | 1 | < 0.1% | |
| https://lendingclub.com/browse/loanDetail.action?loan_id=136220591 | 1 | < 0.1% | |
| https://lendingclub.com/browse/loanDetail.action?loan_id=51877925 | 1 | < 0.1% | |
| https://lendingclub.com/browse/loanDetail.action?loan_id=126099280 | 1 | < 0.1% | |
| https://lendingclub.com/browse/loanDetail.action?loan_id=143652747 | 1 | < 0.1% | |
| https://lendingclub.com/browse/loanDetail.action?loan_id=68577858 | 1 | < 0.1% | |
| https://lendingclub.com/browse/loanDetail.action?loan_id=11686311 | 1 | < 0.1% | |
| https://lendingclub.com/browse/loanDetail.action?loan_id=26760102 | 1 | < 0.1% | |
| https://lendingclub.com/browse/loanDetail.action?loan_id=136103905 | 1 | < 0.1% | |
| Other values (2260658) | 2260658 | > 99.9% | |
| (Missing) | 33 | < 0.1% |
| Value | Count | Frequency (%) | |
| https | 2260668 | > 99.9% | |
| (Missing) | 33 | < 0.1% |
| Value | Count | Frequency (%) | |
| lendingclub.com | 2260668 | > 99.9% | |
| (Missing) | 33 | < 0.1% |
| Value | Count | Frequency (%) | |
| /browse/loanDetail.action | 2260668 | > 99.9% | |
| (Missing) | 33 | < 0.1% |
| Value | Count | Frequency (%) | |
| loan_id=70553619 | 1 | < 0.1% | |
| loan_id=141192501 | 1 | < 0.1% | |
| loan_id=26329678 | 1 | < 0.1% | |
| loan_id=19687013 | 1 | < 0.1% | |
| loan_id=104201158 | 1 | < 0.1% | |
| loan_id=72226814 | 1 | < 0.1% | |
| loan_id=445130 | 1 | < 0.1% | |
| loan_id=3707004 | 1 | < 0.1% | |
| loan_id=15350226 | 1 | < 0.1% | |
| loan_id=90087750 | 1 | < 0.1% | |
| Other values (2260658) | 2260658 | > 99.9% | |
| (Missing) | 33 | < 0.1% |
| Value | Count | Frequency (%) | |
| 2260668 | > 99.9% | ||
| (Missing) | 33 | < 0.1% |
| Distinct count | 124501 |
|---|---|
| Unique (%) | 98.8% |
| Missing | 2134634 |
| Missing (%) | 94.4% |
| Memory size | 17.2 MiB |
| 252 | |
| Debt Consolidation | 13 |
| Borrower added on 03/17/14 > Debt consolidation<br> | 11 |
| Borrower added on 03/10/14 > Debt consolidation<br> | 10 |
| Borrower added on 02/19/14 > Debt consolidation<br> | 9 |
| Other values (124496) |
| Value | Count | Frequency (%) | |
| 252 | < 0.1% | ||
| Debt Consolidation | 13 | < 0.1% | |
| Borrower added on 03/17/14 > Debt consolidation<br> | 11 | < 0.1% | |
| Borrower added on 03/10/14 > Debt consolidation<br> | 10 | < 0.1% | |
| Borrower added on 02/19/14 > Debt consolidation<br> | 9 | < 0.1% | |
| Borrower added on 01/29/14 > Debt consolidation<br> | 8 | < 0.1% | |
| Camping Membership | 8 | < 0.1% | |
| Borrower added on 01/15/14 > Debt consolidation<br> | 7 | < 0.1% | |
| Borrower added on 01/22/14 > Debt consolidation<br> | 7 | < 0.1% | |
| Borrower added on 02/26/14 > Debt Consolidation<br> | 6 | < 0.1% | |
| Other values (124491) | 125736 | 5.6% | |
| (Missing) | 2134634 | 94.4% |
Length
| Max length | 3988 |
|---|---|
| Mean length | 16.21424859 |
| Min length | 1 |
| Value | Count | Frequency (%) | |
| Uppercase_Letter | 31 | 21.8% | |
| Lowercase_Letter | 28 | 19.7% | |
| Other_Punctuation | 20 | 14.1% | |
| Control | 19 | 13.4% | |
| Decimal_Number | 10 | 7.0% | |
| Math_Symbol | 7 | 4.9% | |
| Other_Symbol | 4 | 2.8% | |
| Modifier_Symbol | 3 | 2.1% | |
| Dash_Punctuation | 3 | 2.1% | |
| Open_Punctuation | 3 | 2.1% | |
| Other values (7) | 14 | 9.9% |
| Value | Count | Frequency (%) | |
| Common | 83 | 58.5% | |
| Latin | 59 | 41.5% |
| Value | Count | Frequency (%) | |
| ASCII | 96 | 91.4% | |
| Punctuation | 8 | 7.6% | |
| Specials | 1 | 1.0% |
purpose
Categorical
| Distinct count | 14 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 33 |
| Missing (%) | < 0.1% |
| Memory size | 17.2 MiB |
| debt_consolidation | |
|---|---|
| credit_card | |
| home_improvement | 150457 |
| other | 139440 |
| major_purchase | 50445 |
| Other values (9) | 125478 |
| Value | Count | Frequency (%) | |
| debt_consolidation | 1277877 | 56.5% | |
| credit_card | 516971 | 22.9% | |
| home_improvement | 150457 | 6.7% | |
| other | 139440 | 6.2% | |
| major_purchase | 50445 | 2.2% | |
| medical | 27488 | 1.2% | |
| small_business | 24689 | 1.1% | |
| car | 24013 | 1.1% | |
| vacation | 15525 | 0.7% | |
| moving | 15403 | 0.7% | |
| Other values (4) | 18360 | 0.8% |
Length
| Max length | 18 |
|---|---|
| Mean length | 14.7923038 |
| Min length | 3 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 21 | 95.5% | |
| Connector_Punctuation | 1 | 4.5% |
| Value | Count | Frequency (%) | |
| Latin | 21 | 95.5% | |
| Common | 1 | 4.5% |
| Value | Count | Frequency (%) | |
| ASCII | 22 | 100.0% |
| Distinct count | 63155 |
|---|---|
| Unique (%) | 2.8% |
| Missing | 23358 |
| Missing (%) | 1.0% |
| Memory size | 17.2 MiB |
| Debt consolidation | |
|---|---|
| Credit card refinancing | |
| Home improvement | 137437 |
| Other | 127714 |
| Major purchase | 44840 |
| Other values (63150) |
| Value | Count | Frequency (%) | |
| Debt consolidation | 1153293 | 51.0% | |
| Credit card refinancing | 469691 | 20.8% | |
| Home improvement | 137437 | 6.1% | |
| Other | 127714 | 5.6% | |
| Major purchase | 44840 | 2.0% | |
| Medical expenses | 25388 | 1.1% | |
| Business | 20804 | 0.9% | |
| Car financing | 20526 | 0.9% | |
| Debt Consolidation | 15763 | 0.7% | |
| Vacation | 14443 | 0.6% | |
| Other values (63145) | 207444 | 9.2% | |
| (Missing) | 23358 | 1.0% |
Length
| Max length | 80 |
|---|---|
| Mean length | 17.53985998 |
| Min length | 2 |
| Value | Count | Frequency (%) | |
| Uppercase_Letter | 28 | 25.9% | |
| Lowercase_Letter | 28 | 25.9% | |
| Other_Punctuation | 15 | 13.9% | |
| Decimal_Number | 10 | 9.3% | |
| Control | 8 | 7.4% | |
| Math_Symbol | 6 | 5.6% | |
| Modifier_Symbol | 3 | 2.8% | |
| Close_Punctuation | 2 | 1.9% | |
| Open_Punctuation | 2 | 1.9% | |
| Other_Symbol | 1 | 0.9% | |
| Other values (5) | 5 | 4.6% |
| Value | Count | Frequency (%) | |
| Latin | 56 | 51.9% | |
| Common | 52 | 48.1% |
| Value | Count | Frequency (%) | |
| ASCII | 94 | 100.0% |
| Distinct count | 956 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 34 |
| Missing (%) | < 0.1% |
| Memory size | 17.2 MiB |
| 112xx | 23908 |
|---|---|
| 945xx | 23782 |
| 750xx | 23649 |
| 606xx | 21192 |
| 300xx | 20497 |
| Other values (951) |
| Value | Count | Frequency (%) | |
| 112xx | 23908 | 1.1% | |
| 945xx | 23782 | 1.1% | |
| 750xx | 23649 | 1.0% | |
| 606xx | 21192 | 0.9% | |
| 300xx | 20497 | 0.9% | |
| 331xx | 19051 | 0.8% | |
| 070xx | 18316 | 0.8% | |
| 770xx | 17719 | 0.8% | |
| 891xx | 17162 | 0.8% | |
| 100xx | 17103 | 0.8% | |
| Other values (946) | 2058288 | 91.0% |
Length
| Max length | 5 |
|---|---|
| Mean length | 4.999969921 |
| Min length | 3 |
| Value | Count | Frequency (%) | |
| Decimal_Number | 10 | 76.9% | |
| Lowercase_Letter | 3 | 23.1% |
| Value | Count | Frequency (%) | |
| Common | 10 | 76.9% | |
| Latin | 3 | 23.1% |
| Value | Count | Frequency (%) | |
| ASCII | 13 | 100.0% |
| Distinct count | 51 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 33 |
| Missing (%) | < 0.1% |
| Memory size | 17.2 MiB |
| CA | |
|---|---|
| NY | 186389 |
| TX | 186335 |
| FL | 161991 |
| IL | 91173 |
| Other values (46) |
| Value | Count | Frequency (%) | |
| CA | 314533 | 13.9% | |
| NY | 186389 | 8.2% | |
| TX | 186335 | 8.2% | |
| FL | 161991 | 7.2% | |
| IL | 91173 | 4.0% | |
| NJ | 83132 | 3.7% | |
| PA | 76939 | 3.4% | |
| OH | 75132 | 3.3% | |
| GA | 74196 | 3.3% | |
| VA | 62954 | 2.8% | |
| Other values (41) | 947894 | 41.9% |
Length
| Max length | 3 |
|---|---|
| Mean length | 2.000014597 |
| Min length | 2 |
| Value | Count | Frequency (%) | |
| Uppercase_Letter | 24 | 92.3% | |
| Lowercase_Letter | 2 | 7.7% |
| Value | Count | Frequency (%) | |
| Latin | 26 | 100.0% |
| Value | Count | Frequency (%) | |
| ASCII | 26 | 100.0% |
| Distinct count | 10845 |
|---|---|
| Unique (%) | 0.5% |
| Missing | 1744 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 18.82419644 |
|---|---|
| Minimum | -1 |
| Maximum | 999 |
| Zeros | 1732 |
| Zeros (%) | 0.1% |
| Memory size | 17.2 MiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | 4.94 |
| Q1 | 11.89 |
| median | 17.84 |
| Q3 | 24.49 |
| 95-th percentile | 33.88 |
| Maximum | 999 |
| Range | 1000 |
| Interquartile range (IQR) | 12.6 |
Descriptive statistics
| Standard deviation | 14.18332854 |
|---|---|
| Coefficient of variation (CV) | 0.7534626294 |
| Kurtosis | 1755.261278 |
| Mean | 18.82419644 |
| Median Absolute Deviation (MAD) | 7.567932162 |
| Skewness | 29.20185447 |
| Sum | 42523050.31 |
| Variance | 201.1668086 |
| Value | Count | Frequency (%) | |
| 0 | 1732 | 0.1% | |
| 18 | 1584 | 0.1% | |
| 14.4 | 1577 | 0.1% | |
| 16.8 | 1576 | 0.1% | |
| 19.2 | 1566 | 0.1% | |
| 15.6 | 1506 | 0.1% | |
| 13.2 | 1496 | 0.1% | |
| 12 | 1486 | 0.1% | |
| 20.4 | 1424 | 0.1% | |
| 21.6 | 1391 | 0.1% | |
| Other values (10835) | 2243619 | 99.2% | |
| (Missing) | 1744 | 0.1% |
| Value | Count | Frequency (%) | |
| -1 | 2 | < 0.1% | |
| 0 | 1732 | 0.1% | |
| 0.01 | 22 | < 0.1% | |
| 0.02 | 35 | < 0.1% | |
| 0.03 | 19 | < 0.1% |
| Value | Count | Frequency (%) | |
| 999 | 135 | < 0.1% | |
| 995.6 | 1 | < 0.1% | |
| 995.17 | 1 | < 0.1% | |
| 994.4 | 1 | < 0.1% | |
| 991.57 | 1 | < 0.1% |
| Distinct count | 37 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 62 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.3068791612 |
|---|---|
| Minimum | 0 |
| Maximum | 58 |
| Zeros | 1839108 |
| Zeros (%) | 81.4% |
| Memory size | 17.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 58 |
| Range | 58 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.8672303329 |
|---|---|
| Coefficient of variation (CV) | 2.825966839 |
| Kurtosis | 73.35208962 |
| Mean | 0.3068791612 |
| Median Absolute Deviation (MAD) | 0.4993136191 |
| Skewness | 5.929811375 |
| Sum | 693743 |
| Variance | 0.7520884503 |
| Value | Count | Frequency (%) | |
| 0 | 1839108 | 81.4% | |
| 1 | 281353 | 12.4% | |
| 2 | 81289 | 3.6% | |
| 3 | 29542 | 1.3% | |
| 4 | 13179 | 0.6% | |
| 5 | 6599 | 0.3% | |
| 6 | 3717 | 0.2% | |
| 7 | 2062 | 0.1% | |
| 8 | 1223 | 0.1% | |
| 9 | 818 | < 0.1% | |
| Other values (27) | 1749 | 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 1839108 | 81.4% | |
| 1 | 281353 | 12.4% | |
| 2 | 81289 | 3.6% | |
| 3 | 29542 | 1.3% | |
| 4 | 13179 | 0.6% |
| Value | Count | Frequency (%) | |
| 58 | 1 | < 0.1% | |
| 42 | 1 | < 0.1% | |
| 39 | 1 | < 0.1% | |
| 36 | 1 | < 0.1% | |
| 35 | 1 | < 0.1% |
| Distinct count | 754 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 62 |
| Missing (%) | < 0.1% |
| Memory size | 17.2 MiB |
| Sep-2004 | 15400 |
|---|---|
| Sep-2003 | 15215 |
| Sep-2005 | 14780 |
| Aug-2003 | 14669 |
| Aug-2004 | 14413 |
| Other values (749) |
| Value | Count | Frequency (%) | |
| Sep-2004 | 15400 | 0.7% | |
| Sep-2003 | 15215 | 0.7% | |
| Sep-2005 | 14780 | 0.7% | |
| Aug-2003 | 14669 | 0.6% | |
| Aug-2004 | 14413 | 0.6% | |
| Aug-2001 | 14355 | 0.6% | |
| Aug-2002 | 14322 | 0.6% | |
| Aug-2005 | 14207 | 0.6% | |
| Aug-2006 | 14143 | 0.6% | |
| Oct-2003 | 14108 | 0.6% | |
| Other values (744) | 2115027 | 93.6% |
Length
| Max length | 8 |
|---|---|
| Mean length | 7.999862874 |
| Min length | 3 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 14 | 42.4% | |
| Decimal_Number | 10 | 30.3% | |
| Uppercase_Letter | 8 | 24.2% | |
| Dash_Punctuation | 1 | 3.0% |
| Value | Count | Frequency (%) | |
| Latin | 22 | 66.7% | |
| Common | 11 | 33.3% |
| Value | Count | Frequency (%) | |
| ASCII | 33 | 100.0% |
| Distinct count | 48 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 33 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 698.5882049 |
|---|---|
| Minimum | 610 |
| Maximum | 845 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 17.2 MiB |
Quantile statistics
| Minimum | 610 |
|---|---|
| 5-th percentile | 660 |
| Q1 | 675 |
| median | 690 |
| Q3 | 715 |
| 95-th percentile | 765 |
| Maximum | 845 |
| Range | 235 |
| Interquartile range (IQR) | 40 |
Descriptive statistics
| Standard deviation | 33.01037645 |
|---|---|
| Coefficient of variation (CV) | 0.04725298283 |
| Kurtosis | 1.315168369 |
| Mean | 698.5882049 |
| Median Absolute Deviation (MAD) | 25.83583873 |
| Skewness | 1.192877206 |
| Sum | 1579276000 |
| Variance | 1089.684954 |
| Value | Count | Frequency (%) | |
| 660 | 186580 | 8.3% | |
| 670 | 182119 | 8.1% | |
| 665 | 180759 | 8.0% | |
| 680 | 167199 | 7.4% | |
| 675 | 164016 | 7.3% | |
| 685 | 148009 | 6.5% | |
| 690 | 144690 | 6.4% | |
| 695 | 130941 | 5.8% | |
| 700 | 124184 | 5.5% | |
| 705 | 113074 | 5.0% | |
| Other values (38) | 719097 | 31.8% |
| Value | Count | Frequency (%) | |
| 610 | 2 | < 0.1% | |
| 615 | 1 | < 0.1% | |
| 620 | 1 | < 0.1% | |
| 625 | 2 | < 0.1% | |
| 630 | 6 | < 0.1% |
| Value | Count | Frequency (%) | |
| 845 | 441 | < 0.1% | |
| 840 | 572 | < 0.1% | |
| 835 | 859 | < 0.1% | |
| 830 | 1439 | 0.1% | |
| 825 | 2189 | 0.1% |
| Distinct count | 48 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 33 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 702.5884 |
|---|---|
| Minimum | 614 |
| Maximum | 850 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 17.2 MiB |
Quantile statistics
| Minimum | 614 |
|---|---|
| 5-th percentile | 664 |
| Q1 | 679 |
| median | 694 |
| Q3 | 719 |
| 95-th percentile | 769 |
| Maximum | 850 |
| Range | 236 |
| Interquartile range (IQR) | 40 |
Descriptive statistics
| Standard deviation | 33.01124462 |
|---|---|
| Coefficient of variation (CV) | 0.0469851831 |
| Kurtosis | 1.316769731 |
| Mean | 702.5884 |
| Median Absolute Deviation (MAD) | 25.83606391 |
| Skewness | 1.193116484 |
| Sum | 1588319113 |
| Variance | 1089.742271 |
| Value | Count | Frequency (%) | |
| 664 | 186580 | 8.3% | |
| 674 | 182119 | 8.1% | |
| 669 | 180759 | 8.0% | |
| 684 | 167199 | 7.4% | |
| 679 | 164016 | 7.3% | |
| 689 | 148009 | 6.5% | |
| 694 | 144690 | 6.4% | |
| 699 | 130941 | 5.8% | |
| 704 | 124184 | 5.5% | |
| 709 | 113074 | 5.0% | |
| Other values (38) | 719097 | 31.8% |
| Value | Count | Frequency (%) | |
| 614 | 2 | < 0.1% | |
| 619 | 1 | < 0.1% | |
| 624 | 1 | < 0.1% | |
| 629 | 2 | < 0.1% | |
| 634 | 6 | < 0.1% |
| Value | Count | Frequency (%) | |
| 850 | 441 | < 0.1% | |
| 844 | 572 | < 0.1% | |
| 839 | 859 | < 0.1% | |
| 834 | 1439 | 0.1% | |
| 829 | 2189 | 0.1% |
| Distinct count | 28 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 63 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5768353889 |
|---|---|
| Minimum | 0 |
| Maximum | 33 |
| Zeros | 1381722 |
| Zeros (%) | 61.1% |
| Memory size | 17.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 33 |
| Range | 33 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.8859631583 |
|---|---|
| Coefficient of variation (CV) | 1.53590292 |
| Kurtosis | 9.57608952 |
| Mean | 0.5768353889 |
| Median Absolute Deviation (MAD) | 0.7051338138 |
| Skewness | 2.066186683 |
| Sum | 1304016 |
| Variance | 0.784930718 |
| Value | Count | Frequency (%) | |
| 0 | 1381722 | 61.1% | |
| 1 | 584390 | 25.8% | |
| 2 | 200212 | 8.9% | |
| 3 | 69009 | 3.1% | |
| 4 | 17380 | 0.8% | |
| 5 | 6232 | 0.3% | |
| 6 | 1231 | 0.1% | |
| 7 | 195 | < 0.1% | |
| 8 | 122 | < 0.1% | |
| 9 | 50 | < 0.1% | |
| Other values (18) | 95 | < 0.1% | |
| (Missing) | 63 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 1381722 | 61.1% | |
| 1 | 584390 | 25.8% | |
| 2 | 200212 | 8.9% | |
| 3 | 69009 | 3.1% | |
| 4 | 17380 | 0.8% |
| Value | Count | Frequency (%) | |
| 33 | 1 | < 0.1% | |
| 32 | 1 | < 0.1% | |
| 31 | 1 | < 0.1% | |
| 28 | 1 | < 0.1% | |
| 27 | 1 | < 0.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
First rows
| id | member_id | loan_amnt | funded_amnt | funded_amnt_inv | term | int_rate | installment | grade | sub_grade | emp_title | emp_length | home_ownership | annual_inc | verification_status | issue_d | loan_status | pymnt_plan | url | desc | purpose | title | zip_code | addr_state | dti | delinq_2yrs | earliest_cr_line | fico_range_low | fico_range_high | inq_last_6mths | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 68407277 | NaN | 3600.0 | 3600.0 | 3600.0 | 36 months | 13.99 | 123.03 | C | C4 | leadman | 10+ years | MORTGAGE | 55000.0 | Not Verified | Dec-2015 | Fully Paid | n | https://lendingclub.com/browse/loanDetail.action?loan_id=68407277 | NaN | debt_consolidation | Debt consolidation | 190xx | PA | 5.91 | 0.0 | Aug-2003 | 675.0 | 679.0 | 1.0 |
| 1 | 68355089 | NaN | 24700.0 | 24700.0 | 24700.0 | 36 months | 11.99 | 820.28 | C | C1 | Engineer | 10+ years | MORTGAGE | 65000.0 | Not Verified | Dec-2015 | Fully Paid | n | https://lendingclub.com/browse/loanDetail.action?loan_id=68355089 | NaN | small_business | Business | 577xx | SD | 16.06 | 1.0 | Dec-1999 | 715.0 | 719.0 | 4.0 |
| 2 | 68341763 | NaN | 20000.0 | 20000.0 | 20000.0 | 60 months | 10.78 | 432.66 | B | B4 | truck driver | 10+ years | MORTGAGE | 63000.0 | Not Verified | Dec-2015 | Fully Paid | n | https://lendingclub.com/browse/loanDetail.action?loan_id=68341763 | NaN | home_improvement | NaN | 605xx | IL | 10.78 | 0.0 | Aug-2000 | 695.0 | 699.0 | 0.0 |
| 3 | 66310712 | NaN | 35000.0 | 35000.0 | 35000.0 | 60 months | 14.85 | 829.90 | C | C5 | Information Systems Officer | 10+ years | MORTGAGE | 110000.0 | Source Verified | Dec-2015 | Current | n | https://lendingclub.com/browse/loanDetail.action?loan_id=66310712 | NaN | debt_consolidation | Debt consolidation | 076xx | NJ | 17.06 | 0.0 | Sep-2008 | 785.0 | 789.0 | 0.0 |
| 4 | 68476807 | NaN | 10400.0 | 10400.0 | 10400.0 | 60 months | 22.45 | 289.91 | F | F1 | Contract Specialist | 3 years | MORTGAGE | 104433.0 | Source Verified | Dec-2015 | Fully Paid | n | https://lendingclub.com/browse/loanDetail.action?loan_id=68476807 | NaN | major_purchase | Major purchase | 174xx | PA | 25.37 | 1.0 | Jun-1998 | 695.0 | 699.0 | 3.0 |
| 5 | 68426831 | NaN | 11950.0 | 11950.0 | 11950.0 | 36 months | 13.44 | 405.18 | C | C3 | Veterinary Tecnician | 4 years | RENT | 34000.0 | Source Verified | Dec-2015 | Fully Paid | n | https://lendingclub.com/browse/loanDetail.action?loan_id=68426831 | NaN | debt_consolidation | Debt consolidation | 300xx | GA | 10.20 | 0.0 | Oct-1987 | 690.0 | 694.0 | 0.0 |
| 6 | 68476668 | NaN | 20000.0 | 20000.0 | 20000.0 | 36 months | 9.17 | 637.58 | B | B2 | Vice President of Recruiting Operations | 10+ years | MORTGAGE | 180000.0 | Not Verified | Dec-2015 | Fully Paid | n | https://lendingclub.com/browse/loanDetail.action?loan_id=68476668 | NaN | debt_consolidation | Debt consolidation | 550xx | MN | 14.67 | 0.0 | Jun-1990 | 680.0 | 684.0 | 0.0 |
| 7 | 67275481 | NaN | 20000.0 | 20000.0 | 20000.0 | 36 months | 8.49 | 631.26 | B | B1 | road driver | 10+ years | MORTGAGE | 85000.0 | Not Verified | Dec-2015 | Fully Paid | n | https://lendingclub.com/browse/loanDetail.action?loan_id=67275481 | NaN | major_purchase | Major purchase | 293xx | SC | 17.61 | 1.0 | Feb-1999 | 705.0 | 709.0 | 0.0 |
| 8 | 68466926 | NaN | 10000.0 | 10000.0 | 10000.0 | 36 months | 6.49 | 306.45 | A | A2 | SERVICE MANAGER | 6 years | RENT | 85000.0 | Not Verified | Dec-2015 | Fully Paid | n | https://lendingclub.com/browse/loanDetail.action?loan_id=68466926 | NaN | credit_card | Credit card refinancing | 160xx | PA | 13.07 | 0.0 | Apr-2002 | 685.0 | 689.0 | 1.0 |
| 9 | 68616873 | NaN | 8000.0 | 8000.0 | 8000.0 | 36 months | 11.48 | 263.74 | B | B5 | Vendor liaison | 10+ years | MORTGAGE | 42000.0 | Not Verified | Dec-2015 | Fully Paid | n | https://lendingclub.com/browse/loanDetail.action?loan_id=68616873 | NaN | credit_card | Credit card refinancing | 029xx | RI | 34.80 | 0.0 | Nov-1994 | 700.0 | 704.0 | 0.0 |
Last rows
| id | member_id | loan_amnt | funded_amnt | funded_amnt_inv | term | int_rate | installment | grade | sub_grade | emp_title | emp_length | home_ownership | annual_inc | verification_status | issue_d | loan_status | pymnt_plan | url | desc | purpose | title | zip_code | addr_state | dti | delinq_2yrs | earliest_cr_line | fico_range_low | fico_range_high | inq_last_6mths | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2260691 | 89996426 | NaN | 32000.0 | 32000.0 | 32000.0 | 60 months | 14.49 | 752.74 | C | C4 | Sales Manager | 3 years | MORTGAGE | 157000.0 | Source Verified | Oct-2016 | Charged Off | n | https://lendingclub.com/browse/loanDetail.action?loan_id=89996426 | NaN | home_improvement | Home improvement | 863xx | AZ | 10.34 | 0.0 | Jun-2011 | 735.0 | 739.0 | 0.0 |
| 2260692 | 90006534 | NaN | 16000.0 | 16000.0 | 16000.0 | 60 months | 12.79 | 362.34 | C | C1 | Manager | 10+ years | RENT | 150000.0 | Not Verified | Oct-2016 | Fully Paid | n | https://lendingclub.com/browse/loanDetail.action?loan_id=90006534 | NaN | medical | Medical expenses | 284xx | NC | 12.25 | 0.0 | Aug-1997 | 665.0 | 669.0 | 0.0 |
| 2260693 | 89955820 | NaN | 24000.0 | 24000.0 | 24000.0 | 60 months | 10.49 | 515.74 | B | B3 | Current Operations Officer | 4 years | OWN | 125000.0 | Not Verified | Oct-2016 | Current | n | https://lendingclub.com/browse/loanDetail.action?loan_id=89955820 | NaN | credit_card | Credit card refinancing | 967xx | HI | 10.98 | 0.0 | Feb-2001 | 725.0 | 729.0 | 0.0 |
| 2260694 | 89885898 | NaN | 24000.0 | 24000.0 | 24000.0 | 60 months | 12.79 | 543.50 | C | C1 | Unit Operator | 7 years | MORTGAGE | 95000.0 | Source Verified | Oct-2016 | Current | n | https://lendingclub.com/browse/loanDetail.action?loan_id=89885898 | NaN | home_improvement | Home improvement | 356xx | AL | 19.61 | 0.0 | Dec-1999 | 665.0 | 669.0 | 0.0 |
| 2260695 | 88977788 | NaN | 24000.0 | 24000.0 | 24000.0 | 60 months | 10.49 | 515.74 | B | B3 | Database Administrator | 10+ years | MORTGAGE | 108000.0 | Not Verified | Oct-2016 | Current | n | https://lendingclub.com/browse/loanDetail.action?loan_id=88977788 | NaN | debt_consolidation | Debt consolidation | 840xx | UT | 34.94 | 0.0 | Feb-1991 | 695.0 | 699.0 | 1.0 |
| 2260696 | 88985880 | NaN | 40000.0 | 40000.0 | 40000.0 | 60 months | 10.49 | 859.56 | B | B3 | Vice President | 9 years | MORTGAGE | 227000.0 | Verified | Oct-2016 | Current | n | https://lendingclub.com/browse/loanDetail.action?loan_id=88985880 | NaN | debt_consolidation | NaN | 907xx | CA | 12.75 | 7.0 | Feb-1995 | 705.0 | 709.0 | 1.0 |
| 2260697 | 88224441 | NaN | 24000.0 | 24000.0 | 24000.0 | 60 months | 14.49 | 564.56 | C | C4 | Program Manager | 6 years | RENT | 110000.0 | Not Verified | Oct-2016 | Charged Off | n | https://lendingclub.com/browse/loanDetail.action?loan_id=88224441 | NaN | debt_consolidation | Debt consolidation | 334xx | FL | 18.30 | 0.0 | Jul-1999 | 660.0 | 664.0 | 0.0 |
| 2260698 | 88215728 | NaN | 14000.0 | 14000.0 | 14000.0 | 60 months | 14.49 | 329.33 | C | C4 | Customer Service Technician | 10+ years | MORTGAGE | 95000.0 | Verified | Oct-2016 | Current | n | https://lendingclub.com/browse/loanDetail.action?loan_id=88215728 | NaN | debt_consolidation | NaN | 770xx | TX | 23.36 | 0.0 | Jun-1996 | 660.0 | 664.0 | 1.0 |
| 2260699 | Total amount funded in policy code 1: 1465324575 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 2260700 | Total amount funded in policy code 2: 521953170 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |